On Constraint-Based Lambek Calculi 



Jochen Dorre 

Institut fur Machinelle Sprachverarbeitung (IMS) 
University of Stuttgart, Germany 
email: Jochen.Doerre@ims.uni-stuttgart.de 

and 

Suresh Manandhar 

Language Technology Group 
Human Communication Research Centre 
University of Edinburgh, Scotland 
email: Suresh.Manandhar@ed.ac.uk 



Abstract. We explore the consequences of layering a Lambek proof system over an arbitrary 
(constraint) logic. A simple model-theoretic semantics for our hybrid language is provided for 
which a particularly simple combination of Lambek's and the proof system of the base logic is 
complete. Furthermore the proof system for the underlying base logic can be assumed to be a 
black box. The essential reasoning needed to be performed by the black box is that of entailment 
checking. Assuming feature logic as the base logic entailment checking amounts to a subsumption 
test which is a well-known quasi-linear time decidable problem. 
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1 Background 

In recent years there has been a growing awareness of a need to design grammar log- 
ics that incorporate both the resource-mindedness of categorial grammars and the 
typed constraint-based approach of HPSG [15] [16]. We believe that the long-term 
goal of this enterprise is to provide an incremental and largely deterministic model 
of sentence comprehension within a constraint-based setting — something that 
current HPSG lacks. The constraint-based basis is important, since this provides 
an excellent knowledge representation, engineering and structuring environment 
for NLP. 

Although, there has been a lot of work on finer systems of categorial grammars 
[11] [12] none directly build on a unification based framework. In a system such as 
that of Moortgat's [10] categories can be built using tuples containing type, syntax 
and structure in the spirit of UCG [23]. However, feature descriptions cannot be 
employed for instance to describe the syntax. Thus the consequences of a more 
direct and perhaps more pragmatic approach remains unexplored. In this paper, 
this is precisely what we do. 

UCG [23] and CUG [21] provided the initial integration of such an approach. 
However, both lacked a rigorous model-theory. Even worse, their underlying 
unification-based proof system is incompatible with the straightforward model 
theory of feature-based categorial types (where feature terms take the place of 
basic categories like in (cat: s) / '(cat: np & case: acc)), when making the following 
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two basic assumptions, which we believe are uncontroversially desirable in such a 
theory. 

1. The denotation of complex types should be composed in the same way as 
in standard semantics for (Lambek) categorial systems. No matter whether 
we use string semantics or ternary frame semantics [6], types denote sets of 
objects and a complex type A/B or B\A denotes a left-, resp. right-residual 
w.r.t. a join operation (or relation) 0, i.e., those 'functional' objects which 
when joined with a B object (to their left, resp. right) yield an A object. 

2. Feature terms should induce a subtype ordering on categories in the sense 
that more specific types should denote subsets of less specific ones (e.g. 
\cat: np & case: accj C \cat: npj). 

Now, this implies that, for instance, A/BQB' =>■ A ("when something is the join of 
A/B and B' , then it is of type A v ) if and only if B' is a subtype of B. On the other 
hand this sequent is derived in the unification-based systems whenever B and B' 
unify. Note that there is an essential difference between the two views in that our 
model theory postulates an asymmetry between B and B' i.e. B' is informationally 
more specific than B. In order to refer to one of the two approaches, we will talk 
of subsumption-based vs. unification-based argument binding. 

Here, we explore the consequences of adding a Lambek proof system over an arbi- 
trary (constraint) logic while following the subsumption-based argument binding 
approach, i.e., the one evolving naturally from the model-theoretic view. We 
provide a simple set-theoretic semantics for our hybrid language and show that 
the Lambek proof system carries over to this hybrid logic and furthermore the 
proof system for the underlying base logic can be assumed to be a black box. We 
study primarily the case where the essential reasoning needed to be performed by 
the black box is that of entailment checking. Assuming feature logic as the base 
logic entailment checking amounts to a subsumption test which is a well-known 
quasi-linear time decidable problem [2] [20]. 

Section 2 reviews linguistic motivation in favor of the subsumption-based approach, 
which we borrow from recent work by Bayer and Johnson [3]. The formal frame- 
work for our combined logics will be the (easy) extension of Lambek's calculus 
with subtyping and is presented in Section 3. Next, we define in Section 4 the 
layered Lambek system over arbitrary base logics (for the description of basic 
types) as well as the special case, where feature logic is emplyed. Section 5 closes 
with a discussion on conceivable limitations and the possibility of having both the 
subsumption-based and the unification-based approaches in one system. 

2 Double Coordinations 

Recently, Bayer and Johnson [3] have given an analysis of agreement phenomena in 
coordinations, which strongly supports the view of subsumption-based argument 
binding and which we review here as the primary linguistic motivation for our 
approach. Consider examples la-d. 
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(1) a. Kim [yp [y l> 



>ecame ] [? [^p wealthy ] and [j^p a Republican ]]] 



b. *Kim [yp [y grew ] [? [j^p wealthy ] and [j^p a Republican ]]] 

c. /d. Kim grew [j^p wealthy ] / * [j^p a Republican ] 

Clearly, the contrast between (la) and (lb) should be explained on the basis that 
become admits AP and NP complements, whereas no NP complement is allowed to 
follow grew. Assuming that we do not want to give coordinations a metagrammat- 
ical treatment, but attempt a phrase-structure analysis, then the question arises, 
what category should be given a coordination like in (la). In most 'unification- 
based' approaches to grammar this kind of polymorphic coordination is accounted 
for by requiring that in a coordination the feature structure of the coodination sub- 
sumes (or as a stronger condition: is the generalisation of) all the feature structures 
of the coordinated elements. This is captured by a coordination rule like 



(cf. [18]). Due to the additional assumption that types AP, NP, VP, PP are rep- 
resented with the help of two binary features ±u and ±ra as {+t>, { — v, +n}, 
{+v, —n}, { — v, —n}, respectively, the polymorphic subcategorisation requirement 
of become can be encoded as +n. We now may assume category +ra also for 
the coordination in (la), because this subsumes both AP ({+u, +n}) and NP 
({ — v, +n}). (lb) is correctly ruled out, since grew requires the coordination to be 
{+ra, +f }, and thus it cannot satisfy the subsumption constraint for the NP. 
This approach to coordination, however, does not carry over to 'double coordina- 
tions' considered in [3], where in addition the verbal part consists of a coordination: 

(2) a. * Kim [ grew and remained ] [ wealthy and a Republican ]. 

Assuming a standard encoding of subcategorisation information on the verb the 
coordination rule here would predict a weakening (generalisation) of the subcate- 
gorisation requirements of two conjoined verbs (see Fig. 1). With the above rule 
a phrase "u i and would admit any complement whose type subsumes the type 
that v i selects for as well as the one vi selects for. This fits with the data as long 
as the complement(s) have maximally specific types (i.e. AP, NP, VP, or PP). 
However, the analysis breaks down, if the complement is itself a coordination. For 
such a double coordination construction an analysis using the above coordination 
rule would lead to a complete relaxation of any subcategorisation requirements 
(any combination of verbs can take any combination of phrases as complement), 
since we simply may assume the structure [T] to be [] (empty information), and 
all subsumption and equality constraints of this analysis are trivially met. 
In fact, a construction LL V\ and has to allow for just those complements that 
can follow v i by itself, but also can follow v 2 by itself, imposing the requirements of 
both verbs together. E.g., only an AP complement may follow grew and remained. 
But we also cannot assume the type requirements of the two verbs to simply 
get unified, because they may be inconsistent. At least, this is suggested by the 
following German sentences cited originally by [17] and [9]. 




where Xq C X\ and Xq C X2 
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V: 


+ 




V: - 


N: 


+ 


CONJ 


_N: + 



COMP: [n: +] 

wealthy and a Republican 
grew and remained 

Fig. 1. Overgeneration of the coordination rule in the unification-based setting 



remained [ap] 



vp/npvap npvap wealthy 

grew and vp ap and 

— /ii P 



vp/ap conj 



vp/ap 



npvap con j 



a Republican 
np 
npvap 



vp/ap npvap 
Fig. 2. Partial proof tree showing rejection of (2) in [3] 

(3)a./b./c. Er findet und hilft *Manner /*Kindern / Frauen 

he find-Ace and help-DAT men-Ace children-DAT women- acc+d at 

Bayer and Johnson have a simple and convincing analysis accounting for this phe- 
nomenon. They devise a simple extension to LCG (Lambek Categorial Grammar) 
in which basic types are replaced by propositional formulas built solely from prepo- 
sitional variables, a (conjunction) and v (disjunction). Apart from the original 
LCG rules only the following two rule schemata are needed: 1 



if <f> h ip in propo- 
sitional logic 



A conj A 
A 



Condition: no undischarged 
hypothesis in any conjunct 



The first deals with propositional formulae, the second with coordination. Now, the 
polymorphic verbs become and remained can simply be categorised as vp/ npvap. 
A coordination like wealthy and a Republican receives category npvap by using the 
weakening rule P twice. However, when we want to conjoin grew and remained 
using co, we need to strengthen the argument category of remained to be ap (cf. 
Fig. 2). Thus the ungrammatical double coordination (2) is correctly reject- 
ed. Recasting the analysis in terms of logic, the fact that the subcategorisation 
information acts as a premise of an implication — logically, functor categories 
can be seen as implications — is responsible why in a coordination of verbs this 



Rules are assumed to be in the format of Natural Deduction. 
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1 . 2 

findet [npAaccAdat] hilft [npAaccAdat] 



vp/npAacc npAacc vp/npAdat npAdat 

vp und t;» 

/il r 7 ; /i2 



vp/npAaccAdat conj vp/npAaccAdat Frauen 



vp/npAaccAdat npAaccAdat 

/e 

vp 

Fig. 3. The LCG analysis of (3c) 

Manner Kindern 



npAacc und npAdat 



findet und hilft npi\(accvdat) conj npi\(accvdat) 
vp I 'npAacc Adat npA(accvdat) 

Fig. 4. The LCG analysis showing blocking of (4) 



requirement gets strengthened (the category of the coordination becomes a com- 
mon weakening of all the conjoined verb categories). 

Interestingly, since subcategorisation requirements are encoded as 'feature infor- 
mation' that must be entailed by the respective complement, there is no need to 
view different grammatical cases like dative and accusative as mutually inconsis- 
tent information as in the consistency-based approach. For instance, the German 
verb helfen may require an object to have dative case without disallowing it to be 
accusative as well, in other words, helfen would be specified as vp/ npAdat. Actual- 
ly, this view of grammatical cases seems to be necessary to deal with the difference 
in grammaticality between (3c) and (4). 

(4) * Er findet und hilft Manner und Kindern. 

Frauen simply needs to be of type npAaccAdat and then the extended LCG of [3] 
correctly accepts (3c) (cf. Fig. 3). Also, (4) is rejected (see Fig. 4), since Manner 
und Kindern receives the weaker type npA(accvdat) , and hence does not match 
the combined requirement of findet und hilft. 



3 Lambek calculi with subtyping 



The general approach we take can be seen as adding Lambek slashes to some simple 
logical system, some base logic, for describing grammatical categories. However, 
seen from the outside the only effect of such base logics on the Lambek layer — 
so-to-speak the intercategorial layer — will be that they define orderings between 
'basic' categories, more precisely subtype orderings. For example, we might want 
to allow for basic category descriptions like the above npAaccAsg and npAacc (or 
if we want to employ feature logic [np & case: acch num: sg] and [np & case: acc]) 
together with the stipulation that whenever something is of the former type, it is 
also of the latter. So before looking at Lambek systems over arbitrary base logics, 
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we first give a rigorous treatment of the almost trivial extension to Lambek calculi 
where the set of basic types B is assumed to come with some subtype ordering 
The combination schema defined later on, in which a base logic for the description 
of categories is employed, will be an instance of Lambek systems of this kind. 
Specifically, in the case of feature term descriptions, the subtype ordering will be 
the well-known feature term subsumption relation. 

3.1 Syntax and semantics 

Assume as given a set of basic types B = {&i,&2>---} on which a preorder ■< 
(the subtype ordering) is defined, i.e. we require < C B X B to be reflexive and 
transitive. If b\ < 62, call b\ a subtype of 62 an d ^2 a supertype of b\. 

Definition 1 The set of formulae T of is defined inductively by: 

1. if b is in B, then b is in T; 

2. if A and B are in T , then A/B and B\A are in T . 

Definition 2 A (string-semantic) model for is a Kripke model (W, •, [•]), where 

1. W is a nonempty set; 

2. ■ :W xW 1— > W is an associative operation; and 

3. [•] : J- 1— > 2 W maps formulae to subsets of W and satisfies: 

a) [61] C [6 2 ] whenever b\ < b^; 

b) {A/B} = {x I Vy G IB} (x ■ y G {A})}; and 

c) IB\A} = {y I Vx G IB} (x-yelA})}. 

Let us in parallel to consider the logics resulting from extending with sub- 
typing the variations NL, LP and NLP of Lambek's calculus. Here N abbreviates 
"nonassociative" and P abbreviates "with permutation". We define models for 
NL^ [respectively LP^; respectively NLP^] as in Definition 2, but simply allow- 
ing (requiring) the second component • of models to be an arbitrary [respectively 
commutative and associative; respectively commutative] operation. 

3.2 Gentzen calculi 

There is a well-known common formulation of the proof system of all these Lambek 
logic variants in form of a Gentzen sequent calculus. The extension needed to cover 
subtyping is the same for all variants and almost trivial. 

In Gentzen calculi claims of the form U =>■ A are derived, which can be read 
as "formula A is derivable from the structured database U" , where a structured 
database (or G-term) is built from formulae using the single (binary) structural 
connective 0. 
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The denotation mappings [•] in our models are naturally extended to G-terms by 
denning (symbols U and V will henceforth always denote G-terms) 



{U © V} := {z G W I 3x, y G W (z = x ■ y A x G {U} A y G {V})}. 

A sequent in each of our logics is a pair (U,A), written U =>■ A, where U is a 
G-term and A 6 J. We call a sequent U A valid, if for all string models 
M = (W M , - M , B M ), {Uj M C {A} M . 

( Ax ) I T if 5 i ^ 5 2 

Oi ^> 2 



i/L) 



(\L) 



V ^ B U[A] => C 
U[A/B QV]^C 

V =>> B U[A] => C 
U[V B\A] => C 

V => A U[A] 



i/R) 



i\R) 



U Q B A 
U => A/B 

BQU=>A 
U => B\A 



(Cut) 



C 



U[V] => c 



Fig. 5. Gentzen calculus NL^ 



Now the logical rules for all variant systems are the same and shown in Fig. 5. 
The notation U[X] in rules /L, \L and Cut stands for a G-term U with a distin- 
guished occurrence of the (sub-) G-term X , and by using U\Y] in the same rule we 
denote the result of substituting Y for that occurrence X in ?7[X]. The variations 
between the four logics comes from their different use of the so-called structural 
rules Associativity and Permutation, by which the G-term connective may 
be forced to obey the appropriate combination of being associative or not and being 
commutative or not. However, we do not assume these rules explicitly, but rather, 
as it is standard, let them be implicit in the notion of G-term for the respective 
logic by taking G-terms modulo the respective combination of these rules. For 
instance, in L connective is assumed to be associative, which allows us to see 
G-terms simply as (nonempty) sequences (strings) of formulas with denoting 
concatenation. 

The only departure from standard Lambek calculus is the axiom scheme. It is the 
straightforward generalisation needed to generate all valid sequents of two basic 
formulae. 



3.3 Soundness and completeness 

Let us now check that our proof systems are indeed sound and complete. In the 
following completeness theorem the case of already follows from an argument 
given in [7], showing that the product-free Lambek calculus (i.e. where ■< is 
the identity) augmented by a set of additional axiom schemata R is sound and 
complete w.r.t. the class of string-semantic models satisfying R. The proof here is 
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a straightforward adaptation of this argument, however we give it for all Lambek 
variants simultaneously. 

Theorem 3 U => A is derivable in [resp. NL^, LP^, NLP^y iff U => A is 

valid (with the respective condition on the structural connective Q). 

Proof. The direction from left to right (soundness) is shown by the usual induc- 
tion on length of proofs. Validity of axiom instances directly follows from model 
condition (3a). For the other direction, we construct the canonical model CM as 
follows. Let CM = (W, 0, [•]) with (W, 0) being the free algebra generated by 
over the formulas (modulo the respective structural rules) and let [•] be defined as 

{A} := {u G W | h u => A}. 

Here h T stands for T is derivable in the respective logical system. Observe that 
due to the reflexivity of ■< all sequents derivable in the original Lambek calculus 
(resp. in its variants NL, LP, NLP) are also derivable here. 

1. CM is a model: we check the conditions (a) -(c) in the definition of [•]. 

a) Suppose b\ < b 2 . Then U G iff h U =>■ b\, but then with Cut on 
h b 2 , h U b 2 . 

b) We need to show for all U G W, A, B G T: 

hU A/B iff MV G W (h V B implies h U V A). 

So, suppose for arbitrary U, A, B, and V, h U =>■ A/B and h V =>■ B. 
The latter gives, using / L on A =>■ A, A/BQV =>■ A, and hence with Cw£, 
h (/©y =>■ A. For the converse assume U, A, B such that the condition of 
the right-hand side holds. With V = B we then get \- U Q B A, since 
\- B B is true. But then, by / R we derive the required U =>■ A/B. 

c) symmetrical 

2. CM invalidates underivable sequents: for suppose \/ U A, hence U ^ [A]. 
But clearly C7 G [?7] contradicting the validity of U A. 

This completes the proof. □ 

3.4 Some properties of the calculi 

It is helpful to classify the occurrences of subformulas in ^-formulas and sequents 
into positive (polarity = +1) and negative (polarity = —1) as follows. 

Definition 4 (positive/negative subformula occurrences) 

— A occurs positively in A. 

— If the polarity of B in A is p and B has the form C/D or D\C, then the 
polarity of that occurrence of C [resp. D] in A is p [resp. —p]. 
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Next we extend the subtyping relation ■< to complex types by stipulating 

if A < A' and B' < B then A/B < A'/B' and B\A < B'\A' . (1) 

Evidently, with this definition we remain faithful to the intended meaning of sub- 
typing, namely that it denotes the subset relation. 

Lemma 5 If A <B, then for any model l_AJ C [£?]. 

Using completeness we trivially obtain 

Theorem 6 If A < B, then \- A=$> B. 

The following two derived rules make explicit the monotonicity properties of deriv- 
ability with respect to subtyping on the left- and on the right-hand side. They are 
specialisations of Cut taking into account Theorem 6. Be warned that the notation 
U[A] stands for U with a G-term occurrence A (not an arbitrary subformula). 

(strenqthen L) — [ J ^ ^ if A -< B (weaken R) — — if A -< B 

v y ' U[A] => C ~ v ' U => B 

3.4.1 Cut elimination 

An important result is that the Cut rule is redundant, since from this decidability 
(in fact even A/'P-easyness) directly follows. Cut-free proofs have the nice property 
that the length of the proof is bounded linearly by the size of the sequent to be 
proven. So, let us in the following call NL^ the system comprising of the rules and 
axiom shown in Fig. 5, except for the Cut-rule. 



Theorem 7 (Cut Elimination) U =>■ A is derivable in NL^ iff it is derivable in 
(A proof is given in the appendix.) 



3.4.2 Context-freeness 

We show here that the addition of subtyping to the calculus of Lambek categorial 
grammars does not extend their generative capacity. A categorial grammar based 
on can always be compiled out into a (possibly much larger) Lambek categorial 
grammar (and hence, also into a context-free grammar). 

Assume as given a system = ( J 7 , ) and a finite alphabet T of lexical entities. 

Definition 8 An grammar G for T is a pair (a, S) consisting of the lexical 
assignment a : T 1— > 2^ , such that for any t £ T, a(t) is a finite set of types, and 
the distinguished (sentence) type S G T . 
The language generated by G is 

n 

L(G) := {h...t n e T* I 3B t ...B n (h^ B t . . . B n S A /\ B t G a(U))} 

8 = 1 

Theorem 9 (Context-freeness) For any grammar G the language L(G) is 
context-free. 

(A proof is given in the appendix.) 
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4 Layering Lambek over some base logic 

We now describe how the abstract placeholder of the subtype ordering is sensibly 
filled in in constraint-based Lambek grammar. As explained above we assume 
some specification language (or base logic) whose purpose it is to allow for more 
fine-grained descriptions of the basic categories. Before we proceed to describe a 
concrete choice of base logic, let us first consider abstractly what we demand of 
such a logic and how an integration with Lambek logic can be defined generally in 
a way exhibiting the desired properties. 

Formally, we assume a base logic BL consisting of: 

1. the denumerable set of formulae BT 

2. the class C of models b of the form (W\,, [•]''), where: 

a) W\, is some nonempty set 

b) [-]^ : BT i — y 2 W ^ maps formulae to subsets of W\, 

We will be interested in logical consequence between two single formulae, i.e., in 
the relation (f> \=qi ip Od e f Vb G C ([<^] b C {ipf) for all (f>,ip e ST. We also assume 
a sound and complete deduction system hg|_ for answering this question. Note 
that ^g|_ is a preorder. Now we define the layered Lambek logic L(BL) over some 
given base logic BL as follows. 

Definition 10 The logic L(BL) is the logic L|_ BL over basic types BT . 

This means, we let L(BL) simply be a subtyping Lambek logic L^ taking as < 
the consequence relation over BL-formulae. Thus we inherit the complete and 
decidable (provided ^g|_ is decidable) proof system of L^, but also its limited 
generative capacity. 

Note that every formula of L(BL) has an outer (possibly empty) Lambek part, 
which composes some BL-formulae with the connectives / and \, but no BL con- 
nective may scope over formulae containing / or \ (hence the term 'layered logic'). 

The reader will have noticed that Bayer and Johnson's logic presented in Section 2 
is a layered logic of this kind, where BL is instantiated to be propositional logic 
(restricted to connectives A and V). 

It may be a bit surprising that the generative capacity is not affected by the choice 
of the base logic. Even if we admit full standard first-order logic, we do not exceed 
context-free power. We see a cause of this in the fact that there is no direct 
interrelation between L(BL)-models and base models. Only on the abstract level 
of the consequence relation ^g|_ the notion of base model enters the conditions on 
L(BL)-models. 2 On the other hand L(BL) enjoys a particularly simple and modular 

2 As an alternative to our model conception, consider the option of defining an L(BL)-model 
in a such way that it contains a base model as a part. For instance, when given a base model b, 
we could stipulate 

M = Ut for all^eBT. 
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proof system. Proofs can be built by decomposing the Lambek part of a goal 
sequent with the usual rules until one reaches pure BL-sequents (Lambek axioms). 
Those are then proved in pure hg|_ by independent proofs for each sequent, i.e., 
the proof system hg|_ can be considered a black box. 

4.1 Feature logic as base logic 

We now consider the consequence of choosing feature logic as the base logic. For 
this variant of the logic to be complete we need a proof system for deciding <p> ^p|_ ip 
i.e. checking whether <p> entails ip. Proof systems for deciding entailment between 
feature constraints are well known (cf. [2] [1]). 

We provide a simple entailment checking for a restricted feature logic with the 
following syntax: 

(f>, ip — > x variable 




Since the semantics of feature logic is well known, we do not provide a model 
theoretic semantics here (cf. [19] for details). 

As a first step in determining the entailment <p> \=fi ip we translate the constraints 
x = <p and x = ip into normal form by employing normalisation procedure for 
feature logic (see [19]). A normal form translation of the constraint x = <p results in 
a constraint of the form 3xq . . . x n <p' where <p' is a conjunction of simple constraints. 
In our case, these are of the form : x = a, x = T, x = _L, x = / : y. Thus, normal 
form translation of x = {cat : np & case : acc) gives 3y^yi{x = cat : yo & yo = 
np & x = case : y\ & y\ = acc). The normal form translation, apart from 
providing a simplified conjunction of constraints, also decides consistency of the 
initial formula. 

Now we describe a simple method for deciding <p Nfl V' m the general case where 
both <p an d "P are consistent. Let 3xo...x n <p> (resp. 3yo...y m ip 1 ) denote the 
normal form translation of x = <p> (resp. x = ip). For simplicity of presentation, 
we assume that the existentially quantified variables xq. . .x n and yo ■ ■ .y m are 
disjoint. Since the existential quantification in 3xq . . .x n <p>' is not needed in the 
simplification procedure, we remove this from consideration. We then apply the 
entailment checking procedure given in Figure 6 to <p>' \=^\_ 3yo . . . y m ip' calling <p>' 
the context and 3yo . . . y m ip' the guard. 

instead of condition (a) on [•] (assume W\, C W). (In our model conception such models are 
allowed, but not required). A consequence of this option is that certain logical behaviour of 
BL-formulae may get exported to the Lambek level leading to incompleteness there. E.g., when 
BL contains a constant (or formula), say FALSE, denoting the empty set, any sequent of the form 
[/[false] A will be valid. 




a atom 

T top 

_L bottom 

/ : (f> feature term 

3x{(p>) existentially quantified variable 

(f> & ip conjunction 
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(SAtom) 



x = a k 4> ^ FL 3y ...y n ip 



x = a k (f> ^p|_ 3yo . . . y n x = a k ip 



(SFeat) 



x = f :y k (f> ^ FL 3y ...y n ip 



x = f :y k (f> ^ FL 3y . . .y n x = f : y k ip 



(SFeatExist) 



x = f :y k (f> ^FL 3y . . .y t . . . y n [y/ y t ]ip 



x = f :y k (f> ^FL 3y . . .y t . . .y n x = f : y t k ip 



Fig. 6. Entailment checking in simple feature logic 



The rules given in Figure 6 are to be read from bottom to top. These rules 
simplify the guard with respect to the context. The rules SAtom and SFeat are 
self-explanatory. In rule SFeatExist the notation [y/yi\ip means replacing every 
occurrence of y 8 - with y in ip. Once the entailment checking rules terminate, entail- 
ment and disentailment can be decided by inspection. 

— <p ^fl V' if entailment checking of <p' \=^\_ 3yo . . . y m ip' simplifies ip' to a 
possibly empty conjunction of formula of the form ij{ = T. 

— (f> ^=FL ^ {i- e - 4 1 disentails ip) if entailment checking of <p>' \=fi 3yo . . .y m ip' 
simplifies to the form x = a k <p' \=^\_ 3yo . . . y m x = r k ip' [or the form 
x = t k <p' \=^\_ 3yo . . . y m x = a k tp'] where r is one of: 



— (f> \=fi ip is blocked if neither of the above two conditions hold. 

This completes the basic building blocks needed to implement a proof system for 
a Lambek calculus with feature logic as the base logic. 



The initial motivation for conducting this research was the lack of a model-theoretic 
semantics for unification-based versions of categorial grammar, standing in con- 
trast to the apparently clear intuition of what categorial types over feature terms 
should mean. We were guided by the insight (or basic assumption) that a func- 
tor type A/B, according to the traditional semantics, may always be applied to 
subtypes of B (yielding an A), but not necessarily to supertypes. Combined 
with the idea that feature terms as basic types essentially provide a means to 
express fine-structuring of types, i.e., subtyping, this led us to devise a simple 
model theory embodying just those assumptions, which however is accompanied 
with a subsumption-based (though equally simple) proof system. This work in 



• b with a, b distinct 



5 Discussion 
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a sense complements work by Dorre, Konig and Gabbay [4], in which a model- 
theoretic counterpart of the unification-based proof system is constructed using 
the paradigm of fibred semantics (cf. [8]). 

One important issue for grammar logics upon which we have remained silent up 
to now is semantics construction. There was no need to address it so far, because 
our logic is completely neutral in that respect and does not involve any commit- 
ment to one of the two familiar approaches to construct semantics. We either can 
please the categorial grammar purist and use the Curry-Howard (-van Benthem) 
correspondence to view rules as recipes to cook up lambda terms (cf. [22]). Or 
we may contend aficionados of the HPSG way and employ an additional layer in 
which semantic formulae are built up by unification, as we will see below. 

In a Lambek- van Benthem system, as is well-known, each type is paired with 
a semantic formula and L-rules trigger function application and i?-rules lambda 
abstraction on these formulae. Since this does in no way interfere with our exten- 
sion, we can apply this method unchanged. 

On the other hand, if we were to adopt a HPSG style semantics, then variable 
sharing across categories is needed, i.e., we cannot come by simply adding to 
the feature structures of basic types a SEM feature, since no information can be 
'percolated' out of local trees. What we propose here is to use the combination 
scheme of [4] and combine L(FL) with another layer of feature constraints. In that 
second feature-logical layer, however, unification is employed to match categories. 
We describe here in short how this construction works. 

Double Layering To each basic type in an L(FL) formula associate a new variable. 
On these variables we then can impose (feature) constraints noted in a top-level 
conjunction as a third component of a sequent. E.g., an HPSG-like lexical assign- 
ment for the control verb persuade would be of type (unification-layer variables 
written as superscripts) 

([cat: np] x \ [cat:s] s ) / [cat: np] Y / ([cat:np] Y \ [cat:s&vform:inf] z ), 

together with constraints on the variables X, S, Y, Z. For instance, we might 
require that S is constrained to be a persuading relation as given by: 

RELATION persuadel 
INFLUENCE X 
INFLUENCED Y 
SOA-ARG Z J 

A proof proceeds as in the system L(FL), but additionally maintains as a global 
environment a feature constraint <I>, initially the conjoined feature constraints of 
all formulae in the goal sequent. Whenever we apply the axiom schema on some 
[bi] x =^ [62]^, we a( id X = Y to <I>, normalize, and continue if the result is consis- 
tent. This simply means, we unify the feature graphs (encoded as the constraints) 
of X and Y. Thus the composition of the content structure proceeds in exactly 
the same way as in HPSG (or other comparable unification-based frameworks). 



S 
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The important point here to note is that unification-based and subsumption-based 
argument binding can be combined into a single logic 3 and complement each other. 
Speaking in HPSG terms, we have the flexibility to choose which parts of a sign 
we want only to be matched and which parts to be unified, when it is combined 
with others in a local tree. 4 

On the processing side we believe that the separation of a subsumption-based 
layer (of context-free power) and a unification-based layer offer similar benefits 
like the distinction between c-structure and f-structure in LFG. For instance, we 
can easily precompile the formulas of the subsumption layer into a type hierarchy 
of atomic symbols and thus may be able to employ efficient indexing techniques 
during parsing. 

A final point we want to make concerns other extensions to the original Lambek 
calculus which appear to be prerequisites for many linguistically interesting anal- 
yses. By that we mean for example Moortgat's non-directional slash operator 
allowing for non-peripheral gaps, his operator f|~ for generalised quantifiers, Mor- 
rill's multimodal and discontinuity operators, structural modalities etc. (cf. [13]). 
We believe that our extension of subtyping is well compatible with (at least most 
of) these additional devices and offers an orthogonal extension to these. 

6 Conclusion 

We have shown that a simple and happy marriage between constraint-based gram- 
mars and categorial grammars is technically feasible with an appealingly simple 
model theory. Our hybrid grammar logic permits extant categorial proof systems 
to be carried over in the new system. Furthermore, the logic is parameterised over 
arbitrary (constraint) logics as long as a reasoning mechanism for determining 

3 in much the same way as we can add a unification component for feature structures to a 
context-free grammar to obtain LFG. 

The reader might wonder whether there is a simpler way to allow for information percolation 
through variable binding. Suppose we would use the strategy "after having checked that the 
category serving as actual argument (the complement sign) is subsumed by the functor type's 
argument description (the respective slot of the subcat list of the head), just unify the two". 
Hence, variables in the subsuming type would be bound, possibly carrying that information 
to other parts of the functor type's structure. For instance, we could have a modifier type 
3x[cat: x / cat: x] or a coordination type 3x[cat: x\cat: x / cat: x], the result type of which would 
depend on the type(s) of its argument(s). But consider what happens, when we apply that 
coordination type to two arguments of different types: 

cat: (npvap) 3x[cat: x\cat: x/cat: x] cat: np 

Using I L and \L this reduces to cat: x plus the two sequents cat: np =>• cat: x and cat: (npvap) =>• 
cat:x. Now, if we choose to first prove the first, x gets bound (globally) to np and the second 
sequent fails, but choosing the other order x = (npvap) and the proof succeeds. This means 
that our naive proof procedure is sensitive to the order of rule application. In other words, to 
guarantee the completeness we would have to search for a particular sequence of rule ordering. 
This is an undesirable situation that we want to avoid, since it will result in a vastly inefficient 
proof procedure. 

In addition there is a semantic problem namely that it becomes rather difficult to provide a 
sensible semantics to categories B\A and A/B. In particular, what we witnessed in the example 
above is that the denotation of A in B\A (or A/B) is going to be dynamic and contingent on 
what A actually unifies with. 
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entailment and consistency is provided. We believe that crucial to the success of 
this approach is the novel use of subsumption (or entailment) checking as opposed 
to just unification. 

Appendix 
Proofs of Theorems of Section 3.4 

Cut Elimination 

The following lemma considers derivability of a special case of each of the two 
derived rules above in NL^, laying the seed for the Cut elimination proof. It is 
stated as well as the Cut elimination theorem with respect to derivability in NL^, 
the weakest of the four systems, but it should be kept in mind that these facts 
about derivability hold a fortiori in the other systems as well. 

Lemma 11 If U\bi\ =>■ C [respectively U =>■ b\] is derivable in NL^ and b\ < bi, 
then U[bi] =>■ C [respectively U =>■ 62 _/ is derivable in NL^. 

Proof. By induction on the length of the proof of L = U\b2\ =>■ C. 

n = 0: Then U = 62 an d C = 63 G BT [resp. U = 63]. By transitivity of hg|_, 
bi 63 (= U[bi] =>■ C) [resp. U =>■ b\] is an axiom. 

n > 0: We distinguish cases according to the last rule used in the proof of T and 
state the bracketed cases separately: 

/ R: Then C = A/B and there is a proof of (?7[&2]> B) =>■ A of smaller length. 
Hence by induction hypothesis (IH) we get (U[bi], B) =>■ A and by /i? 
the required sequent. 

[/ R :] cannot be last step 

/ L: U contains distinguished occurrences of the two G-terms 62 an d (A/B, V). 
Hence, either 62 occurs in V, in which case by IH on V =>■ B the claim is 
shown, or U = f/[(A/_B, y), 62] 5 *- e - ^2 occurs also in U[A] as a G-term 
different from A. In this case the claim follows via IH on U[A, 62] =>■ C- 

[/Zr :] obvious, since we get by IH U[A] =>■ 62- 

\L, [\R], and are completely analoguous. □ 

Proof of the Cut Elimination Theorem. We point out what needs to be 
changed in the Cut elimination proof for standard Lambek calculus (cf., e.g., [5]). 
Let the degree of an application of Cut be the number of occurrences of connectives 
in the cut-formula A, the formula eliminated by applying Cut. Then the standard 
argument goes by showing that whenever a sequent L has a proof which contains 
exactly one application of Cut, which is of degree d, and this is the last step in 
that proof, i.e. the proof has the form 
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(Cut) 

then by a case analysis of the two steps introducing T\ and T2, it follows that T 
is derivable from the premises of those steps involving either no Cut or only Cut 
applications of lower degree. The arguments for all the cases carry over identically 
to our system except for the case where one of T's premises is an axiom. So, 
suppose I 1 ! = &i =>■ 62 with 62 being the Cw£-formula. Then ^ = £^[62] =>■ C, 
and whence by Lemma 11, T = U[b\] C has a cut-free proof. But also if 
T 2 = &!=>■ 62 with b\ being cut, we get Y\ = V =>■ b\ and again by Lemma 11 
the claim holds. □ 

CONTEXT-FREENESS 

The following charaterisation of which is a simple consequence of its definition 
for complex types, will be useful in proving the context-freeness theorem. 

Lemma 12 A ■< B iff B is the result of substituting in A or more negative 
subformulae occurrences 61, . . . , b n of basic types by subtypes and or more positive 
subformulae occurrences b[, . . . , b' m by supertypes. 

Proof of the Context-freeness Theorem. We show how to construct for an 
arbitrary grammar G = (a, S) a finite set of (pure) Lambek grammars such that 
L(G) is the union of the languages generated by these. Since Lambek grammars 
generate only context-free languages [14] and context-free languages are closed 
under union, L(G) is context-free. 

Let B\ a be the (finite) subset of B of basic formulae occurring (as subformulae) 
in some a(ti). Call for arbitrary A £ T super + (A) (resp. super~ (A)) the set of 
formulae A' such that A' is the result of replacing in A or more positive (resp. 
negative) subformula occurrences b\, . . . , b n of basic types by respective supertypes 
from B\ a . For instance, if B\ a = {61,62} where b\ -< 62 then super + (61/(61/61)) = 
{61/(61/61), 61/(61/62), 62/(61/61), 62/(61/62)}. We now define 

S := super' (S) = {St, . . . , S m } 
a(t) := {B' \ 3B B e a(t),B' £ super + (B)} 

The Lambek grammars seeked for are G{ = (a, Si) (over the Lambek type system 
Lid) for i = 1, ■ ■ ■ ,m. We let L stand for the union of their languages and show: 

L C L(G): Suppose w = ti,...,t n £ L. Then there exists a j such that w is 
generated by Gj, and hence there are B[, . . . , B' n such that B[ £ a(ti) and l~|_ Jd 
B[ . . .B' n =^ Sj. But then there are B\, . . . , B n such that B[ £ super + (Bi) 
and Bi £ a(ti). Since valid derivations in L/^ remain valid in we get 
h|_^ B[...B' n =^ Sj and then with (strengthen L) and (weaken R) (due to 

Bi ■< B[ and Sj ■< S, cf. Lemma 12) also h|_ B\ . . .B n =^ S, i.e., w £ L(G). 
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L(G) C L: Suppose w G L(G), i.e., there are Bi,...,B n such that h|_^ 

B\...B n =>■ S* and _B 8 - G cn{ti). Assume A is a proof of this sequent in 
and relies on the axiom instances b\ =>■ b[, . . . , bj~ =>■ 6^ (i.e. 6 4 - ^ 6'). If 
we replace these instances by b[ b[, . . . , b' k b' k , but keep the rest of the 
proof structure, we obtain an L/^ proof of a sequent B[ . . . B' n =>■ S', where 
B[ G super + (Bi) for i = 1, . . . , n and S" G super~ (S) (note that the left-hand 
sides of axioms appear as subformulae in positive occurrences on left-hand 
sides or negative occurrences on right-hand sides of derived sequents). Hence 
B[ G ct(ti) for all 1 < i < n and S' = Sj for some 1 < j ' < m, implying that 
w is generated by Gj. □ 
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